Examinng Talker Effects in the Perception of Native and Foreign-accented Speech
نویسندگان
چکیده
Understanding the circumstances under which talker (and other types of) variability affects language perception represents an important area of research in the field of spoken word recognition. Previous work demonstrates that talker effects are more likely when the processing is relatively slow (McLennan & Luce, 2005). Given that listeners may take longer to process foreign-accented speech than native-accented speech (Munro & Derwing, 1995), talker effects should be more likely when listeners are presented with words spoken in a foreign accent than when listeners are presented with those same words spoken in a native accent. The results of two experiments, conducted in two different countries and in two different languages, are consistent with this prediction. Variability can slow recognition of written (Burgund & Marsolek, 1997) and spoken (Bradlow, Nygaard, & Pisoni, 1999) words, supporting theoretical positions with specific representations (Goldinger, 1998). However, variability does not always affect word recognition (McLennan & Luce, 2005), supporting theoretical positions with abstract representations (TRACE; McClelland & Elman, 1986). Consistent with the phonetic relevance hypothesis (Sommers & Barcroft, 2006), some types of variability are more likely to affect spoken word recognition (Bradlow, Nygaard, & Pisoni, 1998). Determining which types of surface and allophonic (Luce McLennan, & Charles-Luce, 2003; McLennan et al., 2003; 2005) variability are more likely to affect spoken word recognition is an important area of research. Since there is evidence for both abstract and specific representations, Luce and McLennan (2005) (also, Luce & Lyons, 1998) suggested that variability might be more likely to affect spoken word recognition at various points during processing. McLennan and Luce (2005) subsequently provided evidence that abstract representations are more likely early and specific representations are more likely later. These authors used a long-term repetition-priming paradigm in which they presented listeners with two blocks of spoken words (primes & targets). Target words were either repeated or new. Repeated words were either spoken by the same (match) or a different (mismatch) talker as prime words. Crucially, the magnitude of specificity (MOS) (i.e., the advantage for repeated words spoken by the same talker relative to a different talker) was more robust during later processing. MOS was significant in the slower (delayed shadowing; hard lexical decision), but not in the faster (speeded shadowing; easy lexical decision) tasks. The only difference between the two shadowing tasks was that participants in delayed shadowing were instructed to delay their response until a response cue appeared (150 ms after stimulus offset). The only difference between the two lexical decision tasks was that the nonwords were unwordlike (low phonotactic probability) in the easy task and wordlike in the hard task. MOS was statistically larger in the hard tasks. These time-course results provided the motivation for the current investigation. Because listeners take longer to recognize words spoken with a foreign accent (Munro & Derwing, 1995), the prediction based on the time-course hypothesis is that talker mismatches should be more likely to affect recognition when words are spoken with a foreign accent. Previous studies have examined variability using signal degradations that result in effortful processing and reduced accuracy (e.g., low-pass filtering, Church & Schacter, 1994; white noise, Goldinger, 1996). Studies by Goldinger (1996) and Luce and Lyons (1998) were among the first to report RT; previous studies focused on accuracy. One aim of the current study is to examine a milder and naturally occurring form of degradation in which accuracy is expected to be high and the main dependent variable is RT. Two recent studies provide additional motivation. First, Vitevitch and Donoso (2011) found more change deafness (inability to detect a talker change) in an easy than a hard lexical decision task. Second, Mattys and Liss (2008) found greater talker effects with dysarthric speech than with healthy speech. Both studies support the time-course hypothesis and the notion that slower processing results in greater sensitivity to talker changes. Both Vitevitch and Donoso (2011) and McLennan and Luce (2005) manipulated processing speed in the lab. To our knowledge, Mattys and Liss (2008) were the first to examine the time course of talker effects without slowing from lab manipulations or artificially degraded stimuli. According to the authors, 3we use the term naturally occurring degraded speech to refer to unedited speech stimuli produced by individuals who, for whatever reason, produce speech that is degraded relative to the speech produced by healthy, native VSHDNHUV ́ SDJH &RQsequently;, one motivation for the current study is to examine talker effects in another form of naturally occurring degraded speech. Foreign-accented speech is of particular interest because it falls within this definition of naturally occurring degraded speech and, unlike dysarthric speech, can be produced by healthy speakers.1 Experiment 1: English with foreign-accented speech Method Participants. Seventy-two participants from the CSU community were paid or received credit for a course requirement. Participants were right-handed (Edinburgh Handedness Inventory, Oldfield, 1971) native speakers of American English with no reported history of speech or hearing disorders. Materials. The stimuli consisted of the words and nonwords used in Mc/HQQDQ DQG /XFH¶V 005) Experiment 2, re-recorded in English by one male and one female native Spanish speaker, both of whom learned English as adults and spoke with a foreign accent. Stimuli were recorded in a sound-attenuated room, low-pass filtered at 10 kHz, and edited into individual files. The mean durations for the experimental words produced by the male (583 ms) and female (574 ms) did not differ, t (22) < 1.0, p = .79. Design. The design followed Experiment 2 of McLennan and Luce (2005). Two blocks of stimuli were presented. Half the stimuli in each block were spoken by each talker. Primes matched, mismatched, or were unrelated to the targets. The talker was the same in the match condition (e.g., bookmale, bookmale) and different in the mismatch condition (e.g., bookmale, bookfemale). Words in the unrelated condition were unprimed. Both blocks consisted of 24 trials (half nonwords). Prime consisted of 8 experimental words, 8 nonwords, and 8 control stimuli (4 nonwords). Targets consisted of 12 experimental words and 12 nonwords. Eight targets matched, 8 mismatched, and 8 were controls. All nonwords and unrelated stimuli were fillers. The focus of the manipulations and analyses is limited to experimental words. A 3 (Prime) X 2 (Talker) completely within-participants design was used. Across participants, each word appeared in every condition, but no participant heard more than one version of a word within a block. Procedure. Participants performed a lexical decision task in which they decided as quickly and accurately as possible whether the stimulus was a real English word or a nonword by pressing one of two buttons (word on the right; nonword on the left) on a SuperLab response box. Between blocks, participants worked on a filler task for approximately five minutes. Stimuli in both blocks were presented binaurally over Sony headphones. An iMac running SuperLab software (Cedrus Corporation, 2006) controlled stimulus presentation and recorded RTs, measured from stimulus onset to button press onset. If the maximum RT (5 s) expired, the computer recorded an incorrect response and presented the next trial. Stimulus presentation within each block was random. Results Following McLennan and Luce (2005), RTs less than 500 or greater than 2,500 ms were excluded (two RTs). Three participants were also excluded.2 Overall accuracy to the experimental words in the target block was 96%. A Prime X Target repeated measures ANOVA was performed on mean RTs to correct responses.3 The main effect of Prime was significant, F1 (2, 126) = 6.90, p = .001, MSE = 18,670.34, Kp 2 = .10; F2 (2, 22) = 6.36, p = .007, MSE = 3,399.00, Kp 2 = .37. Because the focus is on evaluating priming and talker effects, the comparisons of primary interest are between the match and control conditions (Magnitude of Priming, or MOP) and between the match and mismatch conditions (Magnitude of Specificity, or MOS). Match reaction times (RTs) minus control RTs served as the MOP. Match RTs minus mismatch RTs served as the MOS. There are other potential ways to calculate MOP, including (match plus mismatch)/2 minus control, or mismatch minus control. However, we chose to assess MOP on the basis of match minus control in order to be consistent with McLennan and Luce (2005) (as well as other similar studies). Also, inspection of the means in Tables 1 and 2 reveals that such alternative calculations of MOP would have led to the same conclusions overall, albeit somewhat weaker MOPs. As shown in Table 1, comparisons consisting of paired one-tailed t-tests revealed significant MOP and MOS, t1 (68) = 3.08, p < .001, &RKHQ;¶V G = .37; t2 (11) = 3.01, p = .01, &RKHQ;¶V G = .99, and t1 (68) = 1.84, p = .035, &RKHQ;¶V G = .22; t2 (11) = 1.34, p = .10, &RKHQ;¶V G = .40, respectively.4 The difference between the mismatch and control conditions was also significant, t1 (68) = 1.80, p = .038, &RKHQ;¶V G = .22; t2 (11) = 2.26, p = .022, &RKHQ;¶V G = .83. Discussion The results of Experiment 1 are consistent with the time-course hypothesis. Recall that McLennan and Luce (2005) did not obtain talker effects in the same easy lexical decision task (Experiment 2A). A combined ANOVA revealed that the Prime X Experiment (McLennan & /XFH¶V ([SHULPHQW $ ZLWK QDWLYH-accented speech; current Experiment 1 with foreign-accented speech) interaction was not significant, F < 1.0, MSE = 16,735.97, p = .658, Kp 2 = .003. Nevertheless, in addition to a statistically significant MOS effect in the current experiment (-28), and not in Experiment 2A of McLennan and Luce (2005) (-8), an independent one-tailed t-test revealed significantly longer RTs in the current experiment (900 ms) than in Experiment 2A of McLennan and Luce (2005) (773 ms), t (135) = 8.04, p < .01, &RKHQ;¶V G = 1.37, supporting the claim that foreign-accented speech slows processing, allowing specificity effects to emerge. However, an additional (two-tailed) t-test revealed longer stimulus durations in the current experiment (579 ms) than in Experiment 2A of McLennan and Luce (2005) (373 ms), t (23) = 11.07, p < .01, &RKHQ;¶V G = 3.20. Consequently, in order to investigate the relationship between foreignaccented speech and talker effects further, we conducted Experiment 2. The primary motivation for Experiment 2 was to provide a direct within-study comparison of talker effects as a function of accent. Half the participants heard words spoken by a native speaker, and half heard the same words spoken by a non-native speaker with a foreign accent. Furthermore, the durations of the nativeand foreign-accented experimental words were equivalent, allowing us to rule out a duration-based explanation. Experiment 2: Spanish with nativeand foreign-accented speech Method Participants. Seventy-two participants from the Universitat Jaume I (Spain) community were paid or received credit. Participants were right-handed native Spanish speakers with no reported history of speech or hearing disorders. Materials. All stimuli, shown in the Appendix, were recorded in Spanish by one male and one female native American English speaker with a foreign accent, and by one male and one female native Spanish speaker with a native accent.5 The stimuli were recorded, filtered, and edited as in Experiment 1. The mean word frequency for the experimental words was 981 per five million according to LEXESP (Sebastián-Gallés, Marti, Cuetos, & Carreiras, 2000). The mean durations for the experimental words produced by the native (580) and non-native (577) speakers did not differ, t (46) < 1.0, p = .857. Design. The design is identical to Experiment 1, with the exception of adding the between-participants factor Accent (native, foreign). Half the participants heard words and nonwords produced by the native Spanish speakers, and half heard the same stimuli produced by the native American English speakers in Spanish with a foreign accent. Procedure. The procedure is identical to Experiment 1, except the stimuli were presented over AKG-K55 headphones, and the experiment was controlled by Inquisit 1.33 software on a Pentium PC, which recorded RTs. Results No RTs were less than 500 or greater than 2,500 ms.6 Overall accuracy to the experimental words in the target block was 91%. A Prime X Target X Accent mixed factors ANOVA was performed on mean RTs to correct responses. As expected, native-accented words were responded to more quickly (785 ms) than foreign-accented words (981 ms), F1 (1, 60) = 48.64, p < .001, MSE = 85,023.36, Kp 2 = .45; F2 (1, 22) = 37.90, p < .001, MSE = 42,345.44, Kp 2 = .63. Again, the MOP and MOS are of primary interest. The crucial difference between the current experiment and Experiment 1 is our ability to directly evaluate talker effects in the nativeand foreign-accented conditions. The Prime X Accent interaction was marginally significant, F1 (2, 120) = 2.60, MSE = 21,995.14, p = .079, Kp 2 = .04; F2 (2, 44) = 1.04, MSE = 11,715.51, p = .362, Kp 2 = .05. Consequently, MOS and MOP were performed separately for the nativeand foreign-accented conditions, as shown in Table 2. In the native-accent, MOP was significant, t1 (35) = 1.85, p = .04, &RKHQ;¶V d = .31; t2 (11) = 2.95, p = .01, &RKHQ;¶V G = .86, and MOS did not approach significance, t1 (35) < 1.0, p = .38, &RKHQ;¶V G = .05; t2 (11) < 1.0, p = .44, &RKHQ;¶V d = .05. The difference between the mismatch and control conditions was also significant, t1 (35) = 2.17, p = .019, &RKHQ;¶V G = .37; t2 (11) = 2.30, p = .021, &RKHQ;¶V G = .74. In the foreign-accented condition, both MOP and MOS were significant, t1 (35) = 3.04, p < .001, &RKHQ;¶V G = .55; t2 (11) = 2.22, p = .02, &RKHQ;¶V G = .64 and t1 (35) = 2.39, p = .01, &RKHQ;¶V G = .41; t2 (11) = 1.00, p = .17, &RKHQ;¶V G = .25, respectively. The difference between the mismatch and control conditions was not significant, t1 (35) = 1.17, p = .126, &RKHQ;¶V G = .20; t2 (11) = 1.39, p = .096, &RKHQ;¶V G = .40. A critical final comparison consisting of an independent one-tailed t-test was performed in order to directly compare the MOS in the nativeand foreignaccented conditions. These results provided further evidence that MOS was greater in the foreign(-57 ms) than the native-accented (+4 ms) condition, t1 (70) = 2.24, p = .01, &RKHQ;¶V G = .53; t2 (22) < 1.0, p = .21, &RKHQ;¶V G = .34. Discussion The results of Experiment 2 are consistent with the time-course hypothesis. We are not arguing that talker effects are never expected in nativeaccented speech; such evidence already exists (McLennan & Luce, 2005). Rather, our argument is that talker effects are more likely when processing is relatively slow, and consequently, talker effects are more likely in foreignaccented speech. Although both experiments involved foreign-accented speech, the following data suggest that listeners were indeed accessing the intended lexical items. First, accuracy in the lexical decision task was quite high (96% and 91% in Experiments 1 and 2, respectively). Second, we collected additional data in order to address this issue directly. Ten new native speakers of American English at Cleveland State University were asked to identify each of the experimental words for the English stimuli (produced with a Spanish accent) and 10 new native speakers of Spanish at the Universitat Jaume I were asked to identify each of the experimental words for the Spanish stimuli (produced with an American English accent). The results of the English stimuli were as follows: The mean percentages correct for the stimuli produced by the male and female talker were 98% and 94%, respectively. Furthermore, the mean percentage correct for the experimental words was 96%. The results of the Spanish stimuli were as follows: The mean percentages correct for the stimuli produced by the male and female talker were 95% and 96%, respectively. Furthermore, the mean percentage correct for the experimental words was 95%. In short, for both the English and the Spanish stimuli, the foreign-accented words were intelligible across speakers and items. These data provide further evidence that the current results are not simply indicative of a decision under optimal conditions versus decision under uncertainty. Although many studies using degraded stimuli may result in relatively low accuracy, indicative of some greater degree of uncertainty, accuracy in the current experiments was quite high and RT is the main dependent variable. We performed one final analysis directly comparing the combined MOS from the two native-accented conditions (Experiment 2A of McLennan & Luce, 2005 & Experiment 2) and the two foreign-accented conditions (Experiment 1 & Experiment 2). The results of this one-tailed t-test revealed significantly greater MOS in foreignthan native-accented speech, t (207) = 2.05, p = .02, &RKHQ;¶V G = .28. General Discussion The current study demonstrates that talker effects are more likely in foreign-accented speech, consistent with the time-course hypothesis. The evidence is particularly strong given that we not only found greater MOS in foreign-accented speech in our between-study comparison (Experiment 1), but also in our within-experiment comparison (Experiment 2). However, because Clarke and Garrett (2004) have shown that listeners adjust to foreign accents quickly when presented with longer utterances (complete sentences rather than isolated words), the current pattern of results may be restricted to isolated word recognition. That is, if listeners typically adjust to foreign accents quite rapidly, then they may quickly revert to their default pattern of results in which talker effects are less likely to affect their perception of spoken language. Nevertheless, the role that talker-specific representations play when listeners are presented with longer utterances of foreign-accented speech remains an empirical question that should be addressed in future studies. The current study advances our understanding of the circumstances under which talker-specific details affect spoken word recognition (McLennan, 2006) by providing evidence of greater talker effects with foreign-accented speech. To our knowledge, this is the only published study examining the time course of talker effects when OLVWHQHUV¶ SURFHVVLQJ ZDV UHODWLYHO\ VORZ ZLWKRXW VORZLQJ IURP lab manipulations or artificially degraded or disordered speech. The current results support the use of the same theoretical framework in accounting for talker effects LQ OLVWHQHUV¶ SHUFHSWLRQ RI FOHDU VSHHFK DV ZHOO DV naturally occurring degraded speech produced by dysarthric speakers and healthy speakers with a foreign accent. Furthermore, the current results provide important new information beyond the results with dysarthric speech. Some researchers have discussed the role that attention may play in listeneUV¶ SHUFHSWLRQ H J 1\JDDUG DQG DFTXLVLWLRQ H J )UDQFLV Nusbaum, 2002) of abstract and more fine-grained acoustic-phonetic structure. Although we have interpreted our results in terms of the time-course hypothesis, both our results and the time-course hypothesis are compatible with an attentionbased account. The degree of task difficulty may affect the way listeners attend to the signal. When the task is easy, it may be sufficient for listeners to attend to only a few relevant phonemic distinctions in order to perform the task successfully. On the other hand, when the task is difficult, the listener may need to devote more attentional resources to a finer level of phonetic detail, which in turn results in more robust talker effects. Also, it may seem as though we are positing that talker-specific representations are qualitatively distinct from abstract representations, and that talker-specific representations are not playing any role until later. However, we are not arguing for either of these points. First, although our findings are consistent with qualitatively distinct representations, this is not necessarily the case. It is possible that abstract information and talker-specific details are part of a more distributed representation. Second, although our findings provide empirical evidence that abstract information and talker-specific details affect processing at different points in time, it is not necessarily the case that talkerspecific details are not playing any role until later. Rather than assuming that talker-specific representations (if qualitatively distinct) or talker-specific aspects of a distributed representation are not playing any role early (such that it takes longer for this information to become activated), it is possible that all sources of information play a role immediately, but some sources simply take longer for their effects to be detected. In this way, the time-course hypothesis is not necessarily positing that talker-specific information will play no role early, rather that the effects of talker-specific information will always play a larger role later during processing.7 One final point merits discussion. Although the role that surface information, including talker-specific details, plays in the perception of spoken words remains an important issue, researchers have only examined one of the two directions of these effects. Researchers have manipulated surface information, most frequently the talker, and examined the effect that this PDQLSXODWLRQ KDV RQ OLVWHQHUV¶ DELOLWy to recognize the linguistic information (the spoken words). However, the opposite direction remains relatively unexplored. Researchers could manipulate the linguistic information (e.g., high versus low frequency) and examine the effect that this manipulaWLRQ KDV RQ OLVWHQHUV¶ subjective perception of the surface information (e.g., the strength of a non-native VSHDNHU¶V IRUHLJQ DFFHQW 6KDK DQG 0cLennan (2008) have begun to investigate the effect that ease of lexical access (primed versus unprimed) has on OLVWHQHUV¶ accent ratings. Also, Nygaard and Queen (2008; also, Nygaard & Lunders, 2002) SURYLGHG HYLGHQFH WKDW HPRWLRQDO WRQH RI YRLFH FDQ DIIHFW OLVWHQHUV¶ SURFHVVLQJ RI the linguistic content of spoken words. Studies in which both directions in the relationship between linguistic and surface information are examined should lead to a more complete understanding of how listeners represent and process bothtypes of information. ReferencesBradlow, A. R., Nygaard, L. C., & Pisoni, D. B. (1999). Effects of talker, rate, andamplitude variation on recognition memory for spoken words. Perception& Psychophysics, 61, 206-219. Brouwer, S., Mitterer, H., & Huettig, F. (2011). Speech reductions change thedynamics of competition during spoken word recognition. Language and Cognitive Processes. Advance online publication. Burgund, E.D., & Marsolek, C.J. (1997). Letter-case-specific priming in the rightcerebral hemisphere with a form-specific perceptual identification task.Brain and Cognition, 35, 239±258.Cedrus Corporation. (2006). SuperLab Pro (4.0). San Pedro, CA http://www.superlab.com. Church, B. A. & Schacter, D. L. (1994). Perceptual specificity of auditory priming:Implicit memory for voice intonation and fundamental frequency. Journalof Experimental Psychology: Learning, Memory, and Cognition, 20, 521533.Clarke, C. M. & Garrett, M. F. (2004). Rapid adaptation to foreign-accentedEnglish. The Journal of the Acoustical Society of America, 116, 36473658.Francis, A. L. & Nusbaum, H. C. (2002). Selective attention and the acquisition ofnew phonetic categories. Journal of Experimental Psychology: HumanPerception and Performance, 28, 349-366. Goldinger, S. D. (1996). Words and voices: Episodic traces in spoken wordidentification and recognition memory. Journal of Experimental Psychology:Learning, Memory, & Cognition, 22,1166-1183. Goldinger, S. D. (1998). Echoes of echoes? An episodic theory of lexical access.Psychological Review, 105, 251-279. Lively, S. E., Logan, J. S., & Pisoni, D. B. (1993). Training Japanese listeners to identify English /r/ and /l/ II: The role of phonetic environment and talkervariability in learning new perceptual categories. The Journal of theAcoustical Society of America, 94, 1242-1255. Luce, P. A. & Lyons, E. A. (1998). Specificity of memory representations forspoken words. Memory & Cognition, 26, 708-715.Luce, P. A. & McLennan, C. T. (2005). Spoken word recognition: The challengeof variation. In Pisoni, D. B. & Remez, R. E. (Eds.), Handbook of SpeechPerception, pp. 591-609. Malden, MA: Blackwell.Luce, P. A., McLennan, C. T., & Charles-Luce, J. (2003). Abstractness andspecificity in spoken word recognition: Indexical and allophonic variability inlong-term repetition priming. In Bowers, J. & Marsolek, C. (Eds.), Rethinkingimplicit memory, pp. 197-214. Oxford: Oxford University Press. Mattys, S. L. & Liss, J. M. (2008). On building models of spoken word UHFRJQLWLRQ :KHQ WKHUH LV DV PXFK WR OHDUQ IURP QDWXUDO 3RGGLWLHV ́ DV artificial normality. Perception & Psychophysics, 70, 1235-1242. McClelland, J. L. & Elman, J. L. (1986). The TRACE model of speech perception.Cognitive Psychology, 18, 1-86. McLennan, C. T. (2006). The time course of variability effects in the perception ofspoken language: Changes across the lifespan. Language and Speech,49, 113-125.McLennan, C. T. (2007). Challenges facing a complementary-systems approachto abstract and episodic speech perception. Proceedings of the 16thInternational Congress of Phonetic Sciences, 67-70, Saarbrücken, Germany.McLennan, C. T. & Luce, P. A. (2005). Examining the time course of indexicalspecificity effects in spoken word recognition. Journal of ExperimentalPsychology: Learning, Memory, and Cognition, 31, 306-321.McLennan, C. T., Luce, P.A., & Charles-Luce, J. (2003). Representation of lexicalform. Journal of Experimental Psychology: Learning, Memory, and Cognition, 29, 539-553.McLennan, C. T., Luce, P. A., & Charles-Luce, J. (2005). Representation oflexical form: Evidence from studies of sublexical ambiguity. Journal ofExperimental Psychology: Human Perception and Performance, 31, 1308-1314. Munro, M. J. & Derwing, T. M. (1995). Processing time, accent, and comprehensibility in the perception of nativeand foreign-accentedspeech. Language and Speech, 38, 289-306. Nygaard, L. C. (2003). Perceptual stability and information variation: Acommentary on Remez, Goldinger, Azuma, and Local. Journal of Phonetics,31, 345-349. Nygaard, L. C. & Lunders, E. R. (2002). Resolution of lexical ambiguity byemotional tone of voice. Memory and Cognition, 30, 583-593. Nygaard, L. C. & Pisoni, D. B. (1998). Talker-specific learning in speechperception. Perception & Psychophysics, 60, 355-376. Nygaard, L. C. & Queen, J. S. (2008). Communication emotion: Linking affectiveprosody and word meaning. Journal of Experimental Psychology: Human Perception and Performance, 34, 1017-1030. Oldfield, R.C. (1971). The assessment and analysis of handedness: theEdinburgh inventory. Neuropsychologia, 9, 97-113. Pollatsek, A. & Well, A.D. (1995) On the Use of Counterbalanced Designs inCognitive Research: A Suggestion for a Better and More PowerfulAnalysis. Journal of Experimental Psychology: Learning, Memory, and Cognition, 21, 785-794. 5DDLMPDNHUV * :$ IXUWKHU ORRN DW WKH 3Language-as-FixedEffect)DOODF\ ́ Canadian Journal of Experimental Psychology, 57, 141±151. Raaijmakers, J. G. W., Schrijnemakers, J. M. C., & Gremmen, F. (1999). How doGHDO ZLWK 37KH /DQJXDJH-as-Fixed-(IIHFW )DOODF\ ́ &RPPRQ; PLVFRQFHSWLRQVand alternative solutions. Journal of Memory and Language, 41, 416±426. Sebastián-Gallés, N., Martí, M.A., Carreiras, M., & Cuetos, F. (2000). LEXESP:una base de datos informatizada del español. Barcelona: Universitat de Barcelona.Shah, A. P. and McLennan, C. T. (2008). The role of foreign-accentedness inlexical processing. Proceedings of Meetings on Acoustics, 2, 1-13. Sommers, M. S. & Barcroft, J. (2006). Stimulus variability and the phoneticrelevance hypothesis: Effects of variability in speaking style, fundamentalfrequency, and speaking rate on spoken word identification. The Journalof the Acoustical Society of America, 119, 2406-2416. Vitevitch, M. S. and Donoso, A. (2011). Processing of indexical informationrequires time: Evidence from change deafness. The Quarterly Journal of Experimental Psychology, 64, 1484-1493. Appendix.Stimuli used in Experiment 2. The last four rows are for the unrelated fillers trialsused in the prime block only; the remaining rows are for the experimental trials. Words Nonwordssueño suetotirotizofuerza fuermatecho techaguerra guerrededodedasangre sangrigritogrimotiempo tiempipolvopolvipadre padranievenielejefejefihijohicoguardia guardiobosque bosca Author NoteConor T. McLennan, Department of Psychology, Cleveland StateUniversity. Julio González, Department of Basic, Clinical Psychology andPsychobiology, University Jaume I, Castellón, Spain.This research was supported by research grant number R03 DC 007316from the National Institute on Deafness and Other Communication Disorders, National Institutes of Health, and by the Ministry of Science and Innovation ofSpain, research grant PSI2009-10067 (subprogram PSIC), and BRAINGLOT(grant Consolider-Ingenio 2010).We thank Steven Beyer, Jaime Deyling, Alisa Maibauer, Teresa MarkisJessica Newell, Timothy Patrick, Maura Krestar, and several undergraduateresearch assistants for their assistance conducting Experiment 1. Correspondence concerning this article should be addressed to Conor T.McLennan ([email protected]), Cleveland State University, Departmentof Psychology, Language Research Laboratory, 2300 Chester Avenue,Cleveland, Ohio 44115; or to Julio González ([email protected]), Dpt.Psicología Básica, Clínica y Psicobiología. Universitat Jaume I. 12071-Castellónde la Plana, Spain. Footnotes1There are other theoretically important distinctions between accented anddysarthric speech, including the nature of the segmental distortion (accentedsegments may be more phonologically canonical than segments distorted bydysarthria), as well as intelligibility differences (dysarthric speech is likely to bemore difficult to perceive), and the frequency with which listeners are likely to be exposed to dysarthric (less often) and accented (more often) speech.2One participant was removed because no filler task was given to thisparticipant, due to experimenter error, and thus the target block beganimmediately following the prime block. A second participant was removedbecause their overall mean RT to correct responses during the target block wasgreater than two standard deviations above the grand mean. A third participant who reported difficulty hearing the stimuli was removed.3Two dummy variables representing allocation of participants toexperimental lists were included in the ANOVA solely to reduce the estimate ofrandom variation (Pollatsek & Well, 1995); effects involving the dummy variablesare not reported. Note that traditional item analyses are not appropriate for thecurrent experiments. Raaijmakers (2003; also, Raaijmakers et al., 1999) has argued that conducting separate item analyses in designs that usecounterbalanced lists is unfounded. Nevertheless, we report item analyses, morebecause of convention than because of their appropriateness. The reader shouldbear in mind these caveats in interpreting the significance levels of all item testsreported for the current studies. 4One-tailed tests are reported for tests with an a priori prediction about thedirection of an effect (e.g., faster RTs in the match than the mismatch condition).$OVR &RKHQ;¶V d was calculated for within-participant data using an online effectsize calculator. Cognitive Flexibility Laboratory (June 18, 2008). Effect sizecalculator. Retrieved from http://www.cognitiveflexibility.org/effectsize/ on July 7,2010. The typical effect size interpretations for &RKHQ;¶V d are .2 = small; .5 = medium; .8 = large.5Both of the Spanish speakers and all of the Spanish participants werenative speakers of Spanish (i.e., Spanish is their first and primary language).However, it is extremely difficult to find individuals in this area of Spain who aremonolingual speakers. Consequently, many participants and both speakers alsospeak Catalan, English, or both.6Three participants in the foreign-accented condition had errors on both ofthe trials in the control condition for the male talker. As a result, for these threeparticipants, there were no RTs to correct responses for that condition. &RQVHTXHQWO;\ HDFK RI WKHVH WKUHH SDUWLFLSDQW¶V PHDQV IRU WKHLU UHPDLQLQJ ILYH conditions was used to replace the missing value. Note this affected less than1% of the means from this experiment (i.e., 3 out of 432 condition means; 72 participants X 6 condition means for each participant).7There is evidence that blocked and intermixed presentations of casuallyand carefully produced speech can lead to a different pattern of results (Brouwer,Mitterer, & Huettig, 2011). Consequently, future investigations should examinewhether differences in specificity effects for nativeand foreign-accented speech are limited to the blocked design or would extend to a mixed design. Table 1.Reaction Times, Standard Errors (in parenthesis), and Magnitudes of Specificity(MOS) and Priming (MOP) for Experiment 1. Experiment 1M MM C MOS MOP English(with foreign-accented speech)870(13)898(13)931(16)-28* -61* M, MM, and C refer to the Match, Mismatch, and Control conditions, respectively.The * symbol is used to indicate significant effects in the MOS (match ±mismatch) and MOP (match ± control) planned comparisons. Table 2.Reaction Times, Standard Errors (in parenthesis), and Magnitudes of Specificity(MOS) and Priming (MOP) for Experiment 2. Experiment 2M MM C MOS MOP Spanish(with native-accented speech)777(21)773(24)806(28)+4 -29* Spanish(with foreign-accented speech)927(21)984(24)1031(28)-57* -104* M, MM, and C refer to the Match, Mismatch, and Control conditions, respectively.The * symbol is used to indicate significant effects in the MOS (match ±mismatch) and MOP (match ± control) planned comparisons.
منابع مشابه
Examining talker effects in the perception of native- and foreign-accented speech.
Understanding the circumstances under which talker (and other types of) variability affects language perception represents an important area of research in the field of spoken word recognition. Previous work has demonstrated that talker effects are more likely when processing is relatively slow (McLennan & Luce, 2005). Given that listeners may take longer to process foreign-accented speech than...
متن کاملPerceptual adaptation to non-native speech.
This study investigated talker-dependent and talker-independent perceptual adaptation to foreign-accent English. Experiment 1 investigated talker-dependent adaptation by comparing native English listeners' recognition accuracy for Chinese-accented English across single and multiple talker presentation conditions. Results showed that the native listeners adapted to the foreign-accented speech ov...
متن کاملThe Role of Spectral Resolution in Foreign-Accented Speech Perception
Several studies have shown that diminished spectral resolution leads to poorer speech recognition in adverse listening conditions such as competing background noise or in cochlear implants. Although intelligibility is also reduced when the talker has a foreign accent, it is unknown how limited spectral resolution interacts with foreign-accent perception. It is hypothesized that limited spectral...
متن کاملThe neural processing of foreign-accented speech and its relationship to listener bias
Foreign-accented speech often presents a challenging listening condition. In addition to deviations from the target speech norms related to the inexperience of the nonnative speaker, listener characteristics may play a role in determining intelligibility levels. We have previously shown that an implicit visual bias for associating East Asian faces and foreignness predicts the listeners' percept...
متن کاملPerceiving foreign-accented auditory-visual speech in noise: the influence of visual form and timing information
The present study examined the extent to which visual form and timing information assisted in the perception of native English and Japanese-accented English speech in noise. We also examined whether the degree of visual facilitation would be mediated by the talkers’ English experience. Thirty native Australian English listeners performed a speech perception in noise task with English sentences ...
متن کاملClear speech intelligibility: Listener and talker effects
In this study, we investigated whether the intelligibility-enhancing mode of speech production, known as “clear speech” produced by native and non-native talkers influenced speech intelligibility equally for native and non-native listeners. In a series of three experiments, we explored the effect of clear speech for various native and non-native talker and listener pairs. Combined, the results ...
متن کامل